Semantic Correspondence of Database Schema from Heterogeneous Databases using Self-Organizing Map
نویسندگان
چکیده
This paper provides a framework for semantic correspondence of heterogeneous databases using selforganizing map. It solves the problem of overlapping between different databases due to their different schemas. Clustering technique using self-organizing maps (SOM) is tested and evaluated to assess its performance when using different kinds of data. Preprocessing of database is performed prior to clustering using edit distance algorithm, principal component analysis (PCA), and normalization function to identify the features necessary for clustering. Keyword: Semantic integration, heterogeneous databases, semantic correspondence, clustering, data pre-processing, self-organizing maps.
منابع مشابه
Clustering Schema Elements for Semantic Integration of Heterogeneous Data Sources
Interschema relationship identification (IRI), that is, determining the relationships among schema elements in heterogeneous data sources, is an important step in integrating the data sources. This article proposes a cluster analysis based approach to semi-automating the IRI process, which is typically very time-consuming and requires extensive human interaction. The authors apply multiple clus...
متن کاملClustering Similar Schema Elements Across Heterogeneous Databases: A First Step in Database Integration
Interschema relationship identification (IRI), that is, determining the relationships among schema elements in heterogeneous data sources, is an important first step in integrating the data sources. This chapter proposes a cluster analysis-based approach to semi-automating the IRI process, which is typically very time-consuming and requires extensive human interaction. We apply multiple cluster...
متن کاملSemantic Mediation in the National Geologic Map Database (US)
Controlled language is the primary challenge in merging heterogeneous databases of geologic information. Each agency or organization produces databases with different schema, and different terminology for describing the objects within. In order to make some progress toward merging these databases using current technology, we have developed software and a workflow that allows for the “manual sem...
متن کاملSEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks
One step in interoperating among heterogeneous databases is semantic integration: Identifying relationships between attributes or classes in dierent database schemas. SEMantic INTegrator (SEMINT) is a tool based on neural networks to assist in identifying attribute correspondences in heterogeneous databases. SEMINT supports access to a variety of database systems and utilizes both schema infor...
متن کاملSemantic Concept Detection from News Videos with Self-Organizing Maps
In this paper, we consider the automatic identification of video shots that are relevant to a given semantic concept from large video databases. We apply a method of representing semantic concepts as class models on a set of parallel Self-Organizing Maps trained with multimodal low-level features. The presented experiments were conducted using a set of 170 hours of video containing recorded tel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009